Dynamic Compressed Strings with Random Access
نویسندگان
چکیده
We consider the problem of storing a string S in dynamic compressed form, while permitting operations directly on the compressed representation of S: access a substring of S; replace, insert or delete a symbol in S; count how many occurrences of a given symbol appear in any given prefix of S (called rank operation) and locate the position of the ith occurrence of a symbol inside S (called select operation). We discuss the time complexity of several combinations of these operations along with the entropy space bounds of the corresponding compressed indexes. In this way, we extend or improve the bounds of previous work by Ferragina and Venturini [TCS, 2007], Jansson et al. [ICALP, 2012], Nekrich and Navarro [SODA, 2013].
منابع مشابه
Practical Dynamic Entropy-Compressed Bitvectors with Applications
Succinct/compressed data structures aim at providing the same functionality offered by classical data structures while using asymptotically less space. There exist several of these structures for a wide spectrum of applications ranging from strings over arbitrary alphabets to full-text indexing. Their theoretical promises have been met in practice in the static scenario, while the practicality ...
متن کاملOptimal Time Random Access to Grammar-Compressed Strings in Small Space
The random access problem for compressed strings is to build a data structure that efficiently supports accessing the character in position i of a string given in compressed form. Given a grammar of size n compressing a string of size N , we present a data structure using O(n∆ log∆ N n logN) bits of space that supports accessing position i in O(log∆ N) time for ∆ ≤ log O(1) N . The query time i...
متن کاملSpace-efficient Data Structures for Collections of Textual Data
This thesis focuses on the design of succinct and compressed data structures for collections of string-based data, specifically sequences of semi-structured documents in textual format, sets of strings, and sequences of strings. The study of such collections is motivated by a large number of applications both in theory and practice. For textual semi-structured data, we introduce the concept of ...
متن کاملAlgorithms and data structures for grammar - compressed strings
This thesis presents new algorithms and data structures for handling data represented as grammar-compressed strings. The compression scheme we focus on is the Straight Line Program (SLP). In the following, S is an SLP of size n compressing a string S of size N . We consider the following problems. The q-gram profile of a compressed string. We present an algorithm for computing the q-gram profil...
متن کاملData Structures for Grammar-compressed Strings
This thesis presents new algorithms and data structures for handling data represented as grammar-compressed strings. The compression scheme we focus on is the Straight Line Program (SLP). In the following, S is an SLP of size n compressing a string S of size N . We consider the following problems. The q-gram profile of a compressed string. We present an algorithm for computing the q-gram profil...
متن کامل